The Automatic Translation of Film Subtitles. A Machine Translation Success Story?
نویسنده
چکیده
Every so often one hears the complaint that 50 years of research in Machine Translation (MT) has not resulted in much progress, and that current MT systems are still unsatisfactory. A closer look reveals that web-based general-purpose MT systems are used by thousands of users every day. And, on the other hand, special-purpose MT systems have been in long-standing use and work successfully in particular domains or for specific companies. This paper1 investigates whether the automatic translation of film subtitles can be considered a machine translation success story. We describe various projects on MT of film subtitles and contrast them to our own project in this area. We argue that the text genre “film subtitles” is well suited for MT, in particular for Statistical MT. But before we look at the translation of film subtitles let us retrace some other MT success stories. Hutchins (1999) lists a number of successful MT systems. Amongst them is Météo, a system for translating Canadian weather reports between English and French which is probably the most quoted MT system in practical use. References to Météo usually remind us that this is a “highly constrained sublanguage system”. On the other hand there are general purpose but customer-specific MT systems like the English to Spanish MT system at the Pan American Health Organization or the PaTrans system which Hutchins (1999) calls “... possibly the best known success story for custom-built MT”. PaTrans was developed for LingTech A/S to translate English patents into Danish. Earlier (Whitelock and Kilby (1995), p.198) had called the METAL system “a success story in the development of MT”. METAL is mentioned as “successfully used at a number of European companies” (by that time this meant a few dozen installations in industry, trade or banking). During the same time the European Union has been successfully using a customized version of Systran for its translation service but also later for online access by all its employees. Broad coverage systems like METAL and Systran have always resulted in a translation quality that required post-editing before publications.
منابع مشابه
Machine Translation of Film Subtitles from English to Spanish Combining a Statistical System with Rule - based Grammar
In this project we combined a statistical machine translation system for the translation of film subtitles from English to Spanish with rule-based grammar checking. At first we trained the best possible statistical machine translation system with the available training data. The largest part of the training corpus consists of freely available amateur subtitles. A smaller part are professionally...
متن کاملStrategies Used in the Translation of Interlingual Subtitling
This study was an attempt to identify the interlingual strategies employed to translate English subtitles into Persian and to determine their frequency, as well. Contrary to many countries, subtitling is a new field in Iran. The study, a corpus-based, comparative, descriptive, non-judgmental analysis of an English-Persian parallel corpus, comprised English audio scripts of five movies of differ...
متن کاملUsing Linguistic Annotations in Statistical Machine Translation of Film Subtitles
Statistical Machine Translation (SMT) has been successfully employed to support translation of film subtitles. We explore the integration of Constraint Grammar corpus annotations into a Swedish–Danish subtitle SMT system in the framework of factored SMT. While the usefulness of the annotations is limited with large amounts of parallel data, we show that linguistic annotations can increase the g...
متن کاملThe Correlation of Machine Translation Evaluation Metrics with Human Judgement on Persian Language
Machine Translation Evaluation Metrics (MTEMs) are the central core of Machine Translation (MT) engines as they are developed based on frequent evaluation. Although MTEMs are widespread today, their validity and quality for many languages is still under question. The aim of this research study was to examine the validity and assess the quality of MTEMs from Lexical Similarity set on machine tra...
متن کاملLexical Translation in Movies: A Comparative Analysis of Persian Dubs and Subtitles through CDA
The impact of translated movies has already been emphasized by quiet a number of researchers. The present study aimed to investigate the strategies of the Iranian subtitlers and dubbers of English movies in rendering English words. To this aim, three theoretical frameworks were employed: The researchers first used the strategies proposed by Venuti (2012) and Newmark's (1991) classification and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JLCL
دوره 24 شماره
صفحات -
تاریخ انتشار 2009